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Abstract. We investigate the entanglement within a system undergoing a random, 
local process. We find that there is initially a phase of very fast generation and spread 
of entanglement. At the end of this phase the entanglement is typically maximal. In 
5^ ■ PQ we proved that the maximal entanglement is reached to a fixed arbitrary accuracy 

within 0(N 3 ) steps, where N is the total number of qubits. Here we provide a detailed 
and more pedagogical proof. We demonstrate that one can use the so-called stabilizer 
gates to simulate this process efficiently on a classical computer. Furthermore, we 
discuss three ways of identifying the transition from the phase of rapid spread of 
entanglement to the stationary phase: (i) the time when saturation of the maximal 
entanglement is achieved, (ii) the cut-off moment, when the entanglement probability 
distribution is practically stationary, and (iii) the moment block entanglement scales 
exhibits volume scaling. We furthermore investigate the mixed state and multipartite 
setting. Numerically we find that classical and quantum correlations appear to behave 
similarly and that there is a well-behaved phase-space flow of entanglement properties 
towards an equilibrium, We describe how the emergence of typical entanglement can 
be used to create a much simpler tripartite entanglement description. The results form 
a bridge between certain abstract results concerning typical (also known as generic) 
entanglement relative to an unbiased distribution on pure states and the more physical 
picture of distributions emerging from random local interactions. 
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1. Introduction 

Entanglement is a key resource in quantum information tasks and therefore the 
exploration of the structure of entanglement is of important concern in quantum 
information science j2]. Our quantitative understanding of this resource is very strong 
for bipartite entanglement; for reviews see [2] El 0] El El E] , or refer to [8] for an 
introduction to quantum information tasks. However multipartite entanglement [21 [9] is 
much less well understood. In particular there appears to be a plethora of inequivalent 
classes of multipartite entanglement that are locally inequivalent [TO T ITT1 [12 ], [T3l \T$\ [T5] . 
There is hope that one can cut down on this plethora by considering which classes 
are typical (generic) relative to a certain measure on the set of states known as 
the unitarily invariant (Haar) measure [16j. In that measure, practically all pure 
states of large numbers of spins are maximally entangled [T6], [TT] HEl UHl ED]. This 
simplification suggests that the investigation of generic entanglement may hold some 
promise. However, an important question mark has existed as to whether this measure 
is physical, in the sense that it can be approximated to arbitrary precision by two- 
particle interactions in a time that grows polynomially in the size of the system. The 
question has been raised in one form or another in [U Ell [221 E21 EI] • 

Our key objective is to determine whether this is the case. We find and prove that 
it is indeed possible to obtain generic entanglement properties in a polynomial number 
of steps in a two-party random process and give an explicit way of doing it. We also 
aim to gain a deeper understanding of the nature of the approach to the regime where 
entanglement displays generic behaviour. The paper both expands on the results of [1] 
and provides several new results. 

The outline of the paper is as follows: We firstly discuss the key process that is 
used throughout this work: random two qubit interactions. These are modelled as two- 
qubit gates on a quantum computer picked at random. We then give the results of 
the present work. Firstly we prove that the generic entanglement as well as purity is 
achieved efficiently We additionally prove that one can use the so-called stabilizer gates 
to simulate this process efficiently on a classical computer in the sense that the same 
averages will be achieved for relevant quantities. We then discuss more in depth the 
observation that there is initially a phase of rapid spread of entanglement followed by 
a phase where the system is suffused with entanglement and the average entanglement 
across any bipartite cut is practically maximal. We discuss three ways of identifying 
the transition between these two phase: (i) the moment of saturation of the average 
entanglement, (ii) the cut off moment, and (iii) the moment the entanglement scales as 
the volume of the smaller of the two parties. We furthermore investigate the mixed state 
and multipartite setting. We find numerically that the classical correlations appear to 
behave similarly and that there is a well-behaved phase-space flow to the attracting 
equilibrium entanglement, We describe how the emergence of typical entanglement can 
be used to create a much simpler tripartite entanglement description. Finally we give a 
conclusion as well as an extensive discussion of the future of this line of enquiry. 
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The results form a bridge between certain abstract results concerning typical (also 
known as generic) entanglement relative to an unbiased distribution on pure states with 
the more physical picture of entanglement properties relative to distributions obtained 
by random local interactions. 

2. Random two-party interactions 

Interactions in nature tend to be local two-party interactions. In the setting of qubits, 
that corresponds to two-qubit unitary gates. To have a concrete physical process in 
mind, consider a quantum computer which can perform arbitrary single qubit gates and 
CNOT [51] gates between any two qubits in the register. Let the randomisation process 
be the application of random gates on randomly chosen qubits. This process leads to a 
distribution of pure states that will evolve over time gradually, and after a long time, 
approaching the flat distribution. 

2.1. The random walk 

We shall discuss the evolution of states \^)q in an N-qubit Hilbert space Q = 
Qi <8> • • • <S> Qn under a series of randomly chosen mappings. Each mapping is picked 
independently as in figure [TJ 
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Figure 1. Two-qubit random interactions. This shows steps (hi) and (iv) of the 
random walk. For an explanation of such circuit diagrams see [8] . 



(i) Choose U and V independently from Haar measure on U(2) (see appendix A for an 
introduction to the Haar measure). 

(ii) Choose a pair of distinct qubits c and d, uniformly amongst all such pairs. 

(iii) Apply U E U{2) to qubit c and V G 17(2) to d. 
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(iv) Apply a CNOT with target qubit d and control qubit c. 

2.2. Converges to uniform distribution 

The Markov Process described above converges to the uniform distribution on pure 
states (the distribution is described in Appendix A) because the CNOT and arbitrary- 
single qubit unitaries are universal and the only probability measure invariant under all 
unitaries is the Haar measure. In the limit of infinitely many steps we have lost all bias 
towards being close to the initial state. In this setting any pure state is equally likely 
so we have the unitarily invariant distribution. 

It is important to note that the convergence rate to the final distribution is 
exponentially slow in the number of qubits since approximating an arbitrary unitary 
to a fixed precision using a set of fixed-size gates requires a number of steps that grows 
exponentially in the number of qubits [8]. This leads one to question whether it is 
physically relevant to make statements relative to the unitarily invariant distribution. 
Interactions in nature tend to be two-body interactions and should therefore not get 
close to the unitarily invariant distribution in a feasible time, i.e. a time that scales 
polynomially in the total number of qubits. 

2.3. Asymptotic Entanglement Distributions 

Consider the entanglement within a system undergoing the type of randomisation 
described above. We first give some facts about the asymptotic entanglement probability 
distributions, and then give the key results of this work, which concern the rate and 
nature of approach to these asymptotic distributions. 

When states are picked from the unitarily invariant measure there is an associated 
probability distribution of entanglement of a block of spins. Some of the first studies 
of this were [XTf [T8] and the explicit solution for the average entropy of entanglement 
('Page's conjecture') was conjectured in [19] and proven in [20]. It is given by 



with the convention that Na < Nb and where Na + Nb = N, the total number of 
particles. 

This can be used to show that the average entanglement is very nearly maximal, i.e. 
close to min(NA, Nb), for large quantum systems where N 1. It is also interesting to 
note that there is a bound on the concentration of the distribution about this average. 
The probability that a randomly chosen state will have an entanglement E that deviates 
by more than S from the mean value E [S"(pa)] decreases exponentially with <5 2 [16j. 
Therefore one is overwhelmingly likely to find a near-maximally entangled state if the 
system is large. 




(1) 
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3. Theorem 1: Maximal average is achieved efficiently 

Despite the distribution on states requiring a number of steps that grows exponentially in 
the size of the system, we will now prove that achieving the entanglement distribution of 
generic quantum states to within any precision only requires a number of steps growing 
polynomially in the size of the system. An intuition why this is possible can be gained 
by noting that many states have the same entanglement, so attaching an entanglement 
value to all states results in a 'coarse-grained' state space which can be sampled in fewer 
steps. We now state our main 

Theorem 1. Suppose that Nb — Na = t > and some e G (0, 1) is given. Then if the 
number of steps n satisfies 

^ > 9iV(iV- l)[(21n2)AT + ln£- 1 ] 



we have 
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Notice that the second statement of the Theorem is most relevant for t^>l. This 
is because maximal entanglement Na is not exactly achieved when Nb — Na = 0(1). 
A similar problem is present in the asymptotic case analysis of 



3.1. Guide to the proof of Theorem 1 

Firstly we simplify the problem by noting that the purity Tr(p 2 An ) bounds the 
entanglement very tightly in the regime of almost maximal entanglement. Purity is more 
convenient to deal with so we study the convergence rate of the purity to its asymptotic 
value. We expand the global density matrix in terms of elements of the Pauli group 
and track the time evolution of the coefficients. It will turn out that the computation 
of Trp 2 requires only the knowledge of the squares of some of these coefficients. It is 
then a key innovative step to realize that the relevant coefficients evolve according to a 
Markov Chain on a small state space which we give and prove explicitly. We then use 
relatively recent Markov Chain convergence rate analysis tools to determine how fast 
this chain converges to its stationary distribution. These arguments center around the 
size of the 'spectral gap' of the stochastic matrix P defining the Markov Chain, which 
simply means the difference between the second largest eigenvalue Ai and the eigenvalue 
1. This is because P k = 5diag(l, X\, X^.-^S' 1 for some matrix S so the A^ will define the 
most slowly decaying term and thus govern the distance to the stationary distribution. 

3.2. Using purity to bound entanglement 

Most of our mathematical work will be in estimating the quantity E [Tr(p^ n )j. More 
precisely, Theorem [T] follows from 
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Lemma 1. For alln, 

E [Tr(p% r 



2 Na -\- 2 Nb 



2^ + 1 



< 4^ exp 



An 



9N(N 



Before we prove Lemma 1 we demonstrate that Theorem 1 may be deduced from 
it. To see the implication, first notice that the Von-Neumann entropy S(pA, n ) is lower- 
bounded by the Renyi entropy 

S2{pA,n) = - \og 2 {Tr(p\ n )). 

By the concavity of log 2 , we have E [S 2 (pA,n)] > — log 2 (E [Tr(p An )~j), and plugging in 
anas suggested in the Theorem, 



-N A 



1 + 2~* 



+ 2-^e] >N A -\og 9 (l + 2- t +e)>N A - 
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E [S2{ P A,n)\ > ~ l0g 2 f 2 i + 2 _ v 

For the second statement, we note that by Uhlmann's Theorem, the expectation in the 
LHS of ([3]) is given by a fidelity of reduced density matrices. We can then use well-known 
relationships between distance measures on density matrices [H] to deduce 



E 



max (ij) n \^f) 
J*)Asmax. entangled 



E[l-F(I A /2 NA ,p Ajn )] 



< JE[[l-F(I A /2»A,p An )}< 



< JE[\\I a /2 n a - pA 
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(4) 

(5) 
(6) 

< v / 2E[ J D(p A , n ||/ A /2^)]. (7) 

Here D(a\\p) := Tr [a log 2 cr — crlog 2 p] is the relative entropy distance, which in this 
particular case reads 

D(p An \\I A /2 NA ) =N A -S( PA ,n). 
Using the same argument as above, 

D(p An \\ I A /2 N *) <N A -E [S 2 (p An )) <N A + log 2 (E [Tr[p\ n )}) 



<N A + log 2 



2 Na -\- 2 Nb 
2 N + 1 



+ 8^ exp 



An 



9N(N — 1) 



and Theorem 1 follows. 

We are now proceeding with the proof of Lemma 1. The explanation of the first 
main ingredient will require some basic tools from linear algebra. 



3.2.1. Linear algebra in the space of Hermitian operators The proof of Lemma[TJtakes 
an indirect route that requires a quick detour into linear algebra. 

Let us make the following conventions: V[c] represents an operator V acting on 
qubit c. Denote the Pauli-operators (defined in appendix C) by Oi for {i = 1,2,3} and 
use o" = 1. For a string p = p 1 . . . p N e {0, x, y, z} N , 

N 

E P = 2~ N /^a P M 
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is a tensor product of Pauli operators, normalized so that Tr[S^] = 1. It is well-known 
that the operators S p form an orthonormal basis of the real vector space of Hermitian 
matrices over N qubits, with the inner product between A and B given by Tr(AB). 
Therefore, if we write H = YlpHp)^p for a Hermitian operator H, we have 

Tr[H 2 ] = J2Kp) 2 - 



Let us also note that if B C [1, . . . , N] is a non-empty subset of qubits and A 
[1, . . . , N]\B 7^ 0, we can express the tracing out of B in the following form 

Tr B [H] = £ h(p)Tr B [Z p ] 



pe{0,x,y,z} N 

E Hp) 

p€{0,x,y,z} N 



= 2 l B l/2 



E 



2\B\/2 J " \ 2l A l/ 2 



Hp)- 



p€{0,x,y,z} N :VjeB, Pj =0 



2l^l/ 2 



3.2.2. Back to our problem Let us now discuss how to apply the above to our problem. 
Assume that we write 

\^n)(^n\= E £n(p)^(p), 
pe{0,x,y,z} N 

where the ^ n (p)'s are real coefficients. Then 

2-1 



Tr{p\ n ) = Tr 

= 2 N E 



2 \B\/2 



E 



Up) 



<8> i eA (7 pM 

21^1/2 



(8) 



pe{0,x,y,z} N :\/j<£A Pj =0 

E tnipf. 

P e{0,x,y,z} N :VjgA, Pj =0 

Thus it suffices to determine the evolution of the positive coefficients £„(p) 2 . Our key 
idea is to map this evolution to a classical Markov chain and use a rapid mixing analysis 
to understand this evolution. 



3. 3. Evolution of the coefficients 

The main and final goal of this section is to map the evolution of the coefficients 
i 2 n {p) under the random applications of quantum gates onto a random walk over states 
p E {0, x : y, z} N . 

To achieve this end, several preliminary calculations are necessary. However, we 
will only use the following assumption about U and V: 

Requirement 1. At each step of the process, U and V are independently chosen 
from a measure on U (2) such that, if T is distributed according to the same measure, 
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F = F(A, B) is a bi-linear function of A and B, and a, b G {0, x, y, z}, 

f HI, I), a = 6 = 0; 

E[F(T<T a Tt,Tofl*)] = l lE.e^^K,^), a = b?0 

I 0, otherwise. 

It is shown in Section O that Haar measure on U(2) does have the above properties, 
and in section Section 0] that the set of single qubit 'stabilizer gates' also does. 

3.3.1. Basic aspects Now suppose we are given \ipn)AB and choices of c, t, U and V. 
Then \ip n +i) = I[i,...,N]\{c,t} ® W n |V>n), where 

W n = CNOT[c,t}(U[c\ ® V[t]). 

Therefore, 

I VWlKVWl I = 5^ £n(<?)(^l,...,7V]\{c,i} ® W / n)£< Z (^[l,... > 2V]\{ C ,t} ® W / rl ) t 
qe{0,x,y,z} N 

and for any p G {0, x, y, z}^ we find 

£*+l(p) = tr [ S (p)l^+l)(VV t +l|] 

i £ en(g)Tr[(a Pc [c]a Pt [t])^ n (a 9c [c]a 9t [t])iyt] (9) 



4 

g:Vj^{c,t}iji=pi 



and 

f ? , , (n) = 

16 



&-i(p) = 4 E U^a^fe g, <?') (10) 

<?,<?':Vi£{c,t}<?i=^=pi 

where 



G„(p >ff) gO = ^ r [Mc}cr P At})W n (a qc [c]a qt [t})Wl] Tr [{o- pc [c]^MWn{^ c [c]<Trf t [t])W$\ . 

The above expression would appear to suggest that £ 2 (p) depends on the non-positive 
£n> which would prevent the formulation of a Markov process for £ 2 . However, we are 
only interested in averages and this will be the key for a further simplification. Let us 
consider the expectation of G n (p, q, q') conditioned on the values of c, t and ip n . Let us 
notice first that G n (p, q, q') can be rewritten as 

G n (p, q, q') = Tr[a Pc Ua qc U^]Tr[a Pt Va qt V^]Tr[ap c Ua qlc U^Tr^Va^] (11) 

where (p c ,Pt) is the unique pair in {0,x, y,z} 2 such that 

a §c [c]a Pt [t] = ±CNOT[c,t](a Pe [c]a pt [t])CNOT[c,t] (see Tabled). (12) 

Thus G n (p, q, q') is, for fixed V, a bilinear function of Ua qc W and Ua^W; and for fixed 
U, a bilinear function of Va qt V^ and Va q ' V\ Because of Requirement [H we can deduce 
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00 <- 


-> 00 


xO 


<— > 


XX 


yO <- 


-> yx 


zO 


<— > 


zO 


Ox <- 


-> Ox 


Oy 


<— > 




Oz <- 


■* 


XZ 


<— » 


yy 


yz <- 


-> xy 


zx 


<— > 


zx 



Table 1. The map p c p t <-> p c Pt as defined by eq. ([T2|). 



that E [Cr n (p, g, g') | ^n, c, t] =0 unless q c = q' c and g t = q' t , i.e. g = q'. Moreover, in this 
case we have 



E[G n (p,q,q) \ ip n ,c,t] 



1, Pc=Pt = q c = qt = 0; 

§, Pc = g c = 0,pt 7^ 0,q t ^ 0; 

§, ]3 C ^ 0,q c ^ 0,p t = q t = 0; 

§, p c ^ 0,g c ^ 0,pt ^ 0,q t ^ 0; 

otherwise. 



(13) 



It follows that 



E [C+i(p) I iM] 



£n(Pc<-0,t^o)? 
3 ^2w£{x,y,z} £.n(.Pc+-0,t^w) j 
{z,y,z} £n(Pc<-™,t<-o), 



Pi 



0; 



Pc = 0,pt 7^ 0; 
Pc 7^ 0,pt = 0; 



(14) 



9 Ylw,w'G{x,y,z} £n{Pc<-w,t+-w>), Pc 7^ ®,Pt 7^ 0) 



9 ^iw,w'£{x,y,z} ^nyrct-wj^-w' 

where p c ^ W)t ^w> is the string that equals p with p c and p t replaced by w and w', 
respectively. Thus we may now determine a Markov Chain on the coefficients £ 2 Qo). 

3.3.2. The mapping to a Markov Chain We now notice the following key facts. First, 
{in(.<l)}q£{o,x,y,z} N is a probability distribution over {0,x, y,z} N because 



Second, consider the following way to generate a random p G {0, x, z}^ from an 
element q = q\ . . . q^ G {0, x, y, z}^. 

(i) choose a pair (c, t) G [l,...,iV] 2 of distinct elements in [1,...,N], uniformly 
amongst all such pairs; 

(ii) if q c = 0, set w c = 0; else, choose w c G {x, y, z} uniformly at random; 

(iii) if q t = 0, set w t = 0; else, choose w t G {x, y,z} uniformly at random; 

(iv) set p = q c ^wc,t^w t according to the mapping eq. (1T2]) 

We claim that if q is distributed according to {£^(g)} 9 , the distribution of p is given 
by {E[£ n+1 (p) | ip n ]} p . In fact, fix c, t. The "hat map" w c w t w c w t is self-inverse, 
hence the above choice of p corresponds to setting p c p t = w c w t and pi = qi for all 
ie [1, . . . , iV]\{c, t}. Direct inspection reveals that the probability of obtaining p (given 
c,t) is given precisely by (fill) , which (by averaging over c,t) proves the claim. 
We have shown that 
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Lemma 2. Assume that P is a Markov Chain on {0, x, y, z} N where the transitions 
from a state q G {0,x,y, z} N are described by the above random choices. Start this 
chain from go distributed according to {£o(<?)}q- Then the distribution q n of the state of 
the chain at time n satisfies 

Vp G {0, x, y, z} N , F(q n =p) = J2 P n (P, ?)£o(p) = E [UpY I M ■ 

As a result, if Z A = {p G {0,x,y,z} N : Vi G [1, . . . , N]\A, p t = 0}, then (by $)) 

E[Tr\pl n }\i,o]=2 NB V(q n eZ A ) = 2 N » ^ £(p). (15) 

p:{N n CA} 

3. 4- Analysis of the Markov chain and proof of the main result 

Now we need to analyse the Markov Chain that we have defined above with respect to 
its convergence properties. To this end we further simplify the problem by relating it to 
a simpler Markov Chain using some standard techniques that will be outlined below. 

3.4-1- Reduction of the chain Define: 

X(q)={ie[l,...,N) : qi ^0}. (16) 
One can easily show the following: 

Proposition 1. For all n, all E C [1, . . . , N], all q with X(q n ) = E, all e G E, d E, 

if go, gi, <?2, • • • is an evolution of the Markov Chain P, 

F(X(q n+1 )) = EU{d} \q n = q 



F{X{q n+1 )=E\{e} | q n = q 



where \E\ is the cardinality of set E. 

Proof. Assume that we are given q n = q, X(q n ) = E and consider the random procedure 
for P described in the previous section. If q c = q t = 0, then p = q. If q c = but q t ^ 0, 
then w c = and w t is uniformly chosen from {x,y,z}; hence p c p t = w c w t is uniform 
from {Ox, zz, zy}; i.e. X(q) is replaced by X(q) U {c} with probability 2/3 and remains 
the same with probability 1/3. Similarly, if q c ^ but qt = 0, t is added to X(q) with 
probability 2/3 and stays the same otherwise. Finally, if we have q c ^ 0, q t ^ 0, w c w t is 
chosen uniformly from {x, y, z} 2 , hence p c pt is uniform over 



2\E\ 



N 
2 

2(|^| - 1) 



9 



N 
2 



{y0, Oz, yz, xy, xO, Oy, xz, yy, zx}. 
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Thus in this case there are three possibilities: c (and c alone) is removed from X(q) 
(probability 2/9); t (and t alone) is removed from X(q) (probability 2/9); or nothing 
happens (probability 5/9). We deduce from the above that an element d G [1, . . . , N]\E 
can be added to E only if it is one of {c, t} and the remaining element in {c, t} belongs 
to X(q) = E. For each d there are 2\E\ such pairs, out of all possible N(N — 1); and if 
such a pair is chosen, d is added with probability 2/3. On the other hand, an element 
e G E can be removed only if it is one of {c, t} and the remaining element is in E, in 
which case (corresponding to 2(\E\ — 1) out of N(N — 1) pairs), e is actually removed 
with probability 2/9. These assertions imply the proposition. □ 

By Proposition 1, {X n = X(q n )} n is a Markov Chain. Since the only event we are 
interested in is {q n G Z A } = {X n C A} (see eq. (1T51) ). we may restrict our attention to 
this "reduced" chain. For convenience, we state this as a proposition. 

Proposition 2. Let {X n } n be the Markov Chain defined according to P and Proposition 
U\ above, started from a X = X(q ) with qo distributed according to the distribution 
{£o(g) } q given by a state \ipo) (cf. Lemma H)). Then 

E [ Tr [p%n) I V'o] = 2 JVb P (X n C A) . 
From now on, we will only deal with the "reduced" chain X n . 



3.4-2. Dealing with the isolated state It should be clear that X n as defined above is not 
ergodic, as the state is isolated; i.e. there are no transitions to or from it from the 
rest of the state space. This corresponds to the fact that q = ... is an isolated state 
of the initial Markov Chain. 

However, this is not a problem, as we know that 



P(* n = 0)=P(* o = 0)=P(g o = O...O) = £o 2 (O...O) 



Tr[|^o)(^o|] 2 I 
2 1 



This means that we can neglect this state and restrict our calculations with X n to the 
state space f2 = 2' 1,,,,,Ar l\{0}, as we know the contribution of to the final result. 

We now wish to show that the restricted chain is ergodic, i.e. that it has a unique 
stationary distribution A4 for which 

ME, Fen, lim P (X n = F \ N = E) = M(F). 

n— >+oo 

To prove this, it suffices [JT] to show that the chain is irreducible and aperiodic. 
Irreducibility holds when there are sequences of valid transitions between any pair of 
states in Q, which can be easily checked in our case. Aperiodicity means that there is no 
way to split Q = Qi U S7 2 so that all transitions happen from a state in one of fli or fl 2 to 
a state in the other set. But this is implied by the fact that P (X n+ i = E\ X n = E) > 
for all E G Q. This proves ergodicity, as desired. 
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3-4-3. Stationary distribution We now prove that the X n chain on Vt is reversible 
and determine the stationary distribution M.. Reversibility means that the stationary 
distribution Ai on Q satisfies the detailed balance condition: for all distinct C,D e Q, 

M(C)¥ (Ai = D | N = C) = M(D)F {X x = C \ X = D) . (17) 

Since we know that the chain is ergodic, the existence of a M. satisfying the above 
equation implies that this M. is the unique stationary distribution of the X n process. 

We make the ansatz that M.(C) = f(\C\) is a function of \C\ only In the above 
comparison of C and D, we can assume wlog that \C\ — k and \D\ = k + 1 for some 
1 < k < N — 1. Then the reversibility condition becomes 

Ah Ah ^ k 

f(h) -Z: = f(k + 1) f(k + 1) = 3f(k) f(k) = — , 

M ; 3iV(iV-l) M ; 9iV(iV-l) M y M ; M ; Z' 

where Z is a normalizing factor determined by the condition Xlcen-^^) = ^> * -e- 

cen k=i \ / 

Thus 

3 |C| 

is the unique stationary distribution of the {X n } chain restricted to Q. 



3.4-4- Limits Recall from Lemma [2] that the quantity we wish to estimate is 
E \Tr(p\ n ) | ^o] — 2 Ar - B P (X n C A). Using ergodicity of the chain restricted to Q, we 
know that this quantity converges as n — > +oo to 

Jta i 2»-P(*cA)=2»-(i F +(iil=-i) £ A<(0) 



Thus (again using Lemma [2]) 

V^o, h m E [Tr(p 




2^ + 1 



This result can also be deduced directly from the convergence of \ip n ) to a uniformly 
random state asn-> +oo, together with known formulae for the expected purity. 
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3.4-5. Mixing of the reduced Markov Chain Our main goal in this section is to prove 
bounds on the mixing time of the Markov Chain given by X n . We will take an indirect 
route to do so. 

Lemma 3. The Markov chain given by X n has spectral gap > 4/9N(N — 1). 

Proof. Consider a chain B n on Q defined as follows. Assume B„ = B and choose a 
1 < j < iV uniformly at random. If j 6 B and \B\ > 1, set B n+ i = B\{j} with 
probability 1/3 and B n+ i = B with probability 2/3. If j B, set B n+ i = B U {j}. We 
claim that 

Claim 1. B n is reversible, ergodic and has the same stationary distribution M. as X n . 
Moreover, the spectral gap of B n is at least 1/3N. 

Before proving the claim, we show how it implies the lemma, i.e. the > 4/9iV(iV — 1) 
bound for the spectral gap of X n . This is possible via a comparison of Markov chains. 
By Theorem 2.14 in [40], it suffices to show that for all distinct C,D G Q 

M(C)¥{X 1 = D | X = C) > ^z\f { B i = D \ B = C) . (18) 

Indeed, because both X n and B n are reversible chains, both the LHS and RHS are 
symmetric in C, D. Therefore, in proving eq. ffl8l) we can assume that D = C U {j} for 
some j e [1, . . . , iV]\C. Then it is easy to see that 




To finish, we must prove Claim [TJ 

Proof. ( of C/azm[I])Reversibility of B n follows from the fact that if C 6 £1, D = CU{d} e 
fl with d $l C are given, 

M(C)F (B 1 = D\B = C) = = ^) = M(D)F {B x = C | B = D) . 

We use the path-coupling technique of Bubler and Dyer (see e.g. [4T] ) to prove ergodicity 
and the desired spectral gap bound for B n . For B,B' e Q, let d(B,B') = \BAB'\ be 
the Hamming distance between B and B'. Call B and B' adjacent if d(B, B') = 1. We 
will show that one can couple one-step evolutions B±, B[ started from adjacent B , B' Q 
so that 

E[d(B 1 ,B[)-d(B ,B' )] = -^ N . (19) 

This means (cf. reference j4T]) that for arbitrary B , B' , there is a coupling of Bi, . . . , B n 
and B[, . . . , B' n such that for all n > 0, 

2 



E [d(B n , B' n )\ < (l - 3^) d(B , B' ) <N(l 



3N 



(20) 
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From this it follows via a standard argument that the statistical distance/ between B n 
and B' n decays at an exponential rate of < (1 — 1/3N), which also implies the desired 
spectral gap bound. 

The coupling in (f!9|) is indeed very simple. Suppose we are given adjacent 
Bq and B' Q . Without loss of generality, we can assume that Bq = {!,..., k} and 
B' Q = {1, . . . , k + 1} for some 1 < k < N — 1. We choose a j G [1, . . . , N] uniformly at 
random and update B and B' in the following way: 

(i) If 1 < j < k, 

(a) if k > 2, set B[ = B' \{j} and Bi = B \{j} with probability 1/3 and do not 
change the states with probability 2/3; 

(b) else if k — 1 do this only for B[, leaving Bi = B always. 

(ii) else if j = k + 1, 

(a) set B\ = B[ = B' Q with probability 2/3 

(b) OR B 1 = B' , B[ = B with probability 1/3; 

(iii) else rfi + 2<j<N, set B 1 = B U {j}, B[ = B' U {j}. 

Clearly, B\ and B[ each have the right distribution. Moreover, the distance between 
them increases by one (with probability 1/3) in case 1.6, decreases by 1 in case 2. a, and 
remains the same in all other cases. It follows that 

E [d(Bi, B[) — d(B 07 B' )] = - A + E [l {fc=1>1 <j< fc} ] 

1 - 2 _ 1 

- 3iV = _ 3iV' 

i.e. eqns. and ([2D]) follow. □ 

( End of proof of Lemma IB) □ 



3.4-6. End of proof of the main theorem We have shown in the Introduction that the 
Theorem follows from Lemma [TJ Moreover, Proposition [2] shows that (omitting the 
initial state), 

E[Tr(pl n )]=2 N *W(X n cA), 
where N has distribution given by \tpo) in the manner described above. Notice that 

P(* n C A) =F(X = 0) +P(* ± $,X n C A) = 1 + (2JV ~ 1)P( 2 ^ CA| ^^ 0) . 
Using the ergodicity of X n restricted to Q and the limit formula in Section I3.4.4[ 

<2 NB \\X n -M\\ sd , 

where \\X n — A4\\ s d is the statistical distance between M. and the distribution of X n 
restricted to Q. However, we know that 

(i) the restriction of X n is ergodic and reversible; 



2 Nb ¥ (X n C A) 



2 Na + 2 
2^ + 1 
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(ii) its spectral gap is bounded below by A/9N(N — 1); 

(iii) for any E 6 Q, the probability that X n+ i = E given X n = E is (cf. Proposition [I]) 



P {X n+1 = E | X n = E) = 1 



E - 

ee[l,...,AT]\B g 



2|£| 



A r 
2 



eg_E 



2(|£| 



iV 
2 



1 - 



1 - 



2\E\{N-\E\) 2\E\{\E\-l) 



N 
2 



9 



N 
2 



2((iY-l)|E| -||E| 2 ) 



iV 
2 

A simple calculation shows that the numerator above is at most 3(A" — l) 2 /8 < 
N 



3/4 



and that P {X n+1 = E \ X n = E) > 1/4 always. 



It follows from standard Markov chain theory (eg. Corollary 1.15 in [30]) that all 
eigenvalues of the X n chain that are different from 1 lie between —3/4 + 1/4 = —1/2 
and 1 — 4/9N and that for any initial distribution of X 



\X n -M\\sd < 



9N(N-1) 



y/mm C enM{C) 

Since 2 Nb < 2 N , this implies that 

2 n a _)_ 2 Nb 



< 2 N exp 



An 



2^ + 1 

which finishes the proof of Theorem 1. 



< 4^ exp 



9A^(A^ - 1) 



An 

9N(N - 1) J ' 



4. Efficient simulation of process on classical computer 

We now show that the purity evolution during this random process, which corresponds to 
a randomly chosen quantum computer circuit, can be simulated efficiently on a classical 
computer in the sense that the same statistics can be obtained with a polynomial effort 
in N. The efficient simulation is achieved by noting that one can use stabilizer states, 
a discrete and finite subset of general quantum states which can be parameterised 
efficiently (for a brief introduction to stabilizer states see Appendix C). This is of interest 
since it may lead to methods to efficiently simulate quantum systems with a high degree 
of entanglement. 

Stabilizer states, despite having various restrictions, possess a rich entanglement 
structure exhibiting multi-partite entanglement [281 1201 HEH EH [32], [33] and may be 
used in the approximate description of ground states of Hamiltonians [33]. Here we 
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will use two general facts about stabilizer states. Firstly we note that for a single 
qubit there are six stabilizer states, given by the +1 eigenvectors of the Pauli operators 
oi, —0i, 02, —o~2, 03, —03- One can note that these are evenly distributed over the Bloch 
Sphere, so are similar in that regard to the unitarily invariant measure on U(2). Secondly 
we use the known fact that {H,S,CNOT} form a universal set for stabilizer gates, in that 
any gate that maps the set of stabilizer states onto and into itself can be decomposed 
as a combination of those. 

The entanglement probability distribution on stabilizer states is derived in [30]. In 
a system of iV spins where Na(Nb) is Alice's (Bob's) number of qubits the probability 
of finding that entanglement between A and B in a randomly chosen pure stabilizer 
state equals E is given by 

_ n£ (2- + 1) * _ i )(2 ^ _ 22,-1) 

YYk=N-N A +l\ z ' ^ l Jj=l 

where E is an integer. This can be used to show the average is nearly maximal and the 
distribution squeezes up around the average with increasing N. 

Considering the random walk at hand, we note that the expected purity of a flat 
distribution of stabilizer states is the same as that for the unitarily invariant measure 
on general states [27]. 

So the stabilizer random walk described earlier will asymptotically yield the same 
expected purity as the general state wall|j]- 

Each one-qubit stabilizer gate permutes the six states. The stabilizer gate invariant 
distribution is therefore that which is flat on these states. 

Lemma 4. Assume that T is drawn from a flat distribution of the 67 single qubit 
stabilizer gates. Then requirement 1 is satisfied. 

Proof. In the notation defined in Lemma [6] in appendix B, the six stabilizer states are 
given by (r x , r y , r z ) = (1, 0, 0), (-1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), (0, 0, -1). One can 
use the proof that Haar measure on U(2) satisfies requirement 1 which is provided in 
Appendix B, but replacing the Haar measure on U(2) with the flat distribution on the 
six stabilizer states. □ 

Requirement 1 is is the only requirement made about how we pick the single qubit 
gates U and V in the proof of Theorem 1. Accordingly, with regard to the evolution 
of the expected purity, the stabilizer random walk is equivalent on average to that on 
general states when concerned with the average purity of subsystems. 

Our result explains why averaging the entanglement evolutions of different stabilizer 
random walk realizations reproduces the behaviour of the corresponding general state 
two-party process. An example of this can be seen in the numerical simulations shown 
in figures [2] and El 

| An alternative way of explaining why the asymptotic time average of the purity is the same, whether 
using the stabilizer circuit or the general state circuit, is the concept of 2-designs; see [35l [36] 
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100 



Figure 2. Entanglement evolution 
of stabilizer states where N = 10 
and Na = 5. Here a single real- 
ization only. The dramatic jumps 
reflect the fact that stabilizer en- 
tanglement only comes in integer 
values. 



Figure 3. Here averaged over 20 
realizations. One sees that the 
average behaviour is like that of 
general states. 



5. Two phases in approach to typical entanglement 

Theorem 1 tells us that systems undergoing this sort of random interactions will tend 
to become maximally entangled very fast. Numerical studies in addition reveal a finer 
structure in the approach, namely two phases, apparently separated by a well-defined 
and short transition interval. 

(i) A phase in which entanglement is rapidly increasing and spreading through the 
system. 

(ii) A phase in which entanglement has spread through the entire system. 

We identify three ways of defining the moment of time that separates these two 
phases that will be explained in the following: the saturation moment r sat , the cutoff 
moment T cuto fj or the volume scaling moment t vo \. 

5.1. Saturation moment 

Applying the random interaction we make the following 

Observation: There is a moment of saturation of the average entanglement. 

Before this moment the average block entanglement increases essentially linearly in 
the time n. After the transition moment it is however practically constant and nearly 
maximal. Therefore we term the transition time between the two regimes the 'satura- 
tion time' T sa t- There is some degree of freedom in what exact value to assign to this 
time. One could for example specify T sat (e) as the moment that the gradient of the 
average entanglement curve is within some fixed accuracy e to 1. Figure H] shows how 
this moment is reached. 
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Average Entanglement 



Asymptotic (Haar) Average 



x 



Steps 



sat 



Figure 4. Illustrating T sat . 
simulations. 



We find that this behaviour is generic from numerical 



5.2. Cut off moment 

The numerical observation that there appears to be two time-scales involved here, first 
a rapid approach to the asymptotic value and then a slow one, led us to study the 
statistical mathematics literature for tools that quantify this. In fact there has been 
extensive study of such problems, motivated by the fact that a randomisation process, 
such as shuffling cards in a casino, is in practise performed only a finite number of 
times. The question is then, how many shuffles are necessary before one is certain, for 
all practical purposes, that the cards are shuffled. In the setting concerned here, that 
corresponds to asking when we are certain, for practical purposes, that the entanglement 
probability distribution has achieved its asymptotic form. The tool we will use here is 
the so called "cut-off effect", which is exhibited by many Markov Chains [38]. 




The cut-off refers to an abrupt approach to the stationary distribution occurring at 
a certain number of steps taken in the chain. Say we have a Markov chain defined by its 
transition matrix P, and that it converges to a stationary distribution n. Initially the 
total variation distance TV =\\ P — tt \\= sup \ P(E)—it(E) | between the corresponding 
probability distributions is given by TV = 1. After k steps this distance is given by 
TV(k) =|| P k - vr ||. A cut-off occurs, basically, if TV(k) ~ 1 for k = 0, 1,2, ...a and 
thereafter falls quickly such that after a few steps TV(k) ~ 0. As we increase the size 
of the state space, the number of steps during which the abrupt approach takes place 
should decrease compared to a, the number of steps necessary to reach the cut off. Then, 
for very large state spaces, we can say that the randomisation occurs at a steps, some 
function of the size of the state space. 

As a precise definition of a cut off we use [38J. 
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Definition: Let P n , ir n be Markov Chains on sets Xn- Let a n , b n be functions tending 
to infinity, with — tending to zero. Say the chains satisfy an a n , b n cutoff if for some 
starting states x n and all fixed real 9 with k n = \_a n + 9b n \ , then 

||P n fc "-7r n ||^c(0) (22) 

with c(8) a function tending to zero for 9 tending to infinity and to 1 for theta tending 
to minus infinity. 

With this definition it is now possible to study whether there is a sharp cut-off time 
associated with the entanglement probability distribution, a functional on our specific 
Markov process. Indeed, numerically we make the 

Observation: We observe an apparent cut off effect in the entanglement probability 
distribution under the two-particle interaction random process described in this work. 

Figure [5] shows an example of a cut off for general states. The state space has been 
discretised by rounding off entanglement values to the nearest integer. We observe that 
TV ~ 1 for a while and then falls. Finally there is a stage where TV ~ 0. The effect 
becomes more pronounced with increasing N. 




5 10 15 20 25 30 35 

stens. rescaled so curves meet at TV =0.5 



Figure 5. Observe cut-off effect for general states. The total variation distance to the 
asymptotic entanglement probability distribution, TV ~ 1 for some finite time, before 
falling to 0. The fall becomes more dramatic with increasing system size. Note that, 
as is the customary way of representing this effect, we have rescaled the curves to meet 
where TV ~ 0.5. 

Now consider the analogous situation for stabilizer states. From Lemma H] we would 
expect this behaviour to be representative of that of general states. Since stabilizer states 
are efficiently parameterised, using them this will allow us to scale further with N. 
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Observation: We observe an apparent cut off effect in the entanglement probabil- 
ity distribution under the stabilizer two-particle interaction random process described 
in this work. 

How this squeezes up is showed for individual runs, averaged over 1000 realizations, 
in figure El 




20 40 

Steps(rescaled) 

Figure 6. The cut-off effect in the total variation distance (TV) is here studied 
for larger N by employing stabilizer states and tools to efficiently evaluate their 
entanglement [31]. One sees that the effect becomes increasingly pronounced with 
increasing N. We conjecture that the function is a step function in the limit of N — > oo, 
consistent with the normal behaviour of the cut-off effect. 

As stated, the cut off effect is common in classical Markov Chains such as card 
shuffling [38]. It is a testament to the universality of mathematics that applying quantum 
gates at random to a quantum register apparently exhibits the same features, in this 
regard, as applying shuffles to a deck of cards. 

We term this the cut off moment T cuto ff- Using this moment to separate the first 
and second phase has the advantages that it unambiguously gives one moment, and 
this moment corresponds to the point when the entanglement distribution equals, for 
practical purposes, the asymptotic one. 

5.3. The area to volume scaling transition 

We now consider the two-party random process restricted to nearest-neighbours inter- 
actions, as in figured This helps us to relate two pictures of typical entanglement. One 
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Figure 7. N qubits with nearest neighbour interactions. We vary Na at different 
times and ask what the typical entanglement is. 



is the notion that following a randomising process entanglement appears all pervasive 
in the system. In this case one may expect that the entanglement between two blocks 
scales roughly as the size of the smaller part, i.e. we observe a volume scaling. While 
this is the behaviour for generic states it is in strong contrast to the behaviour of pure 
states that are typically appearing in nature such as the ground states of Hamilton 
operators describing systems with short range interactions. Indeed, in such systems the 
block entanglement in ground states has been proven to scale as the boundary surface 
area between the two blocks for a wide variety of systems [I3j EH US, HE]. This indicates 
that ground states of short-range Hamiltonians tend to explore only a tiny fraction of 
the entire Hilbert space as their entanglement properties are indeed rather atypical. 



Numerical Observation: The entanglement scales as the area at small times and 
as the volume of the smaller block at large times. 

This observation motivates the introduction of the transition time from area to vol- 
ume scaling r vo \ and is shown in figure [HJ The result could have been expected since the 
nearest neighbour structure is only respected at small times since at longer times the 
two-qubit unitaries have combined to form global unitaries. To assign an exact value 
to this moment one can specify T vo i(e) as the moment max\E [S(pa)] — E [S(pa, t )] I < e 
where the maximisation is over all partitions. 



6. Multipartite and mixed states 

So far we have considered pure bipartite states. It is known that the unitarily invariant 
measure is not uniquely defined for mixed states however. This can be visualised already 
in the case of one qubit. On one qubit a unitary time evolution corresponds to a rotation 
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Figure 8. N = 40 qubits with nearest neighbour interactions. We vary Na at 
different times and ask what the typical entanglement is. Sufficiently high statistics 
were taken that the statistical error was not visible on this plot. The asymptotic curve 
was calculated using equation 1211 



of the Bloch Sphere. Pure states lie on the surface of the sphere and the unitarily 
invariant measure is an uniform distribution on the surface of the sphere. However 
mixed states can lie anywhere in the ball inside the sphere and there is no longer a 
unique measure as unitary invariance does not provide any restrictions on the radial 
distribution. Mixed measures may be induced by considering environments that allow 
for purifications of mixed states. Then the measure will be obtained by using the uniform 
measure on the purifications and then determine the measure that is obtain by partial 
tracing. Nevertheless some ambiguity remains related to the size of the environment. 
As is evident from the results here, increasing the size of the environment concentrates 
the measure more and more around the maximally mixed state. 

Before evaluating the entanglement between A and B one traces out the 
environment C. This leads to AB to be in a mixed state. In fact, with increasing size 
of the environment the state of AB tends to become more mixed and the entanglement 
between A and B becomes negligible and ultimately disappears. This is a possible 
argument for emerging classicality. To measure entanglement in the mixed state we used 
the logarithmic negativity, En, defined in |47| HEJ H9] and proven to be an entanglement 
monotone in [38] . 

E N (p) = log 2 \\p TA \\i. (23) 
Hence here the entanglement is quantified as E N (tr c (pABc)) One can also note that for 
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mixed states there are classical correlations. The mutual information, S(pa) + S(ps) - 
S{pab)i can be interpreted as the combination of classical and quantum correlations [50 



Observation: The mutual information behaves very similarly to the quantum cor- 
relations (entanglement)in the two particle-interaction random walk simulations. 

This suggest the classical correlations behave similarly to the entanglement in the ran- 
dom two-party process, as in figure [91 
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Figure 9. An example of a numerical simulation showing that the average classical and 
quantum correlations behave similarly. The quantum correlations are quantified using 
the logarithmic negativity. The combination of classical and quantum correlations is 
quantified using the mutual information. Here N — 10, Na — 3, Nb — 3 and Nc = 4. 

The usefulness of the entanglement as a possible macroscopic parameter is 
highlighted in figure \IU\ which shows how the equilibrium values is an attractor point 
regardless of initial state and that there is a flow towards this point. The latter 
observation hints that the average entanglement is a good parameter also in the approach 
to the attracting equilibrium. 

6.1. Simplified multipartite description 

Considering generic entanglement only will hopefully provide simplifications in various 
settings. Here we suggest a way in which it simplifies the multipartite setting. We 
consider the tripartite setting. Let N qubits be shared between three parties of 
size Na, N b and N c , s.t. N A + N B + N c = N and N A < N B < N c . We 
can then consider many types of bipartite cuts. Let E(A\B) signify the (unitarily 



Emergence of typical entanglement in two-party random processes 



24 



Entanglement phase space 



2.5 

n 

> 

< 2 
o 

s 

| 1.5 

3 
to 



0.5 




















s» 


























s ^ 






























- 






' - 




























s 






/ 


s 


/ 


s 


X /* 




/ / / / 






/ y 


/ 


/ 


/ 


/ 


/ 


/ 


f 


/ / 


/ 


/III 


J t 




/ / 


/ 


/ 


/ 


/ 


/ 


/ 


1 


/ / 


/ 


/III 


I t 


\ \ 


/ / 


/ 


/ 


/ 


/ 


/ 


/ 


1 


/ / 


/ 


t t 1 1 


1 1 


1 \ 


/ „ 


/ 


/ 


/ 


/ 


/ 


/ 


! 


/ t 


t 


till 


\ \ 


1 \ 



0.5 



1.5 2 2.5 3 

Entanglement of AB vs C 



3.5 



Figure 10. Entanglement 'Phase space' flow diagram. The arrows point in the average 
flow direction and the length indicates the speed. The entanglement between A and 
B is quantified as EN(trc{pABc)) and the entanglement between AB and C using 
S{pc) We see how different initial states will all tend to the attractor point. The 
horizontal coordinate of this point is given by equation [T] The vertical coordinate by 
E [E N (tr c {pABc))]- Here N = 10, N A = 3, N B = 3 and N c = 4. The region in the 
top right corner is empty as such states are not physically realizable. 



invariant) entanglement average of the entanglement across the A\B cut after C has 
been traced out, and let E(A + B\C) signify the corresponding quantity across the 
A + B\\C cut. The set Q of all possible bipartite entanglement averages is then 
n = {E(A\B),E{A\C),E{B\C),E{A + B\C),E{A + C\B),E(B + C\A)}. Let these 
values specify 'the tripartite entanglement'. Now for an arbitrary state, it is not the 
case that Q is uniquely determined by E(A\B) together with E(A + B\C). However we 
make the following claim. 

Conjecture: Q is uniquely determined by E(A\B) together with E(A + B\C) 
Support for conjecture: Firstly note that if A^4, Nb and Nc are specified then Q is 
uniquely determined. Then we need the following two statements to be true to prove 
the claim: 

1. Na + Nb and Nq are uniquely specified by E(A + B\C). This is true if the Von 
Neumann entropy of entanglement increases monotonically when a qubit is given from 
the larger of the two parties to the smaller. We are concerned with the limit of large N, 
where equation [T] implies that Nc = \E(A + B\C)] assuming Nc < Na + Nb 

2. For the given iV<7 and N, N^ and iV^ are uniquely determined by E(A\B). 
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Again one should be able to use a monotonicity argument here, although it will be more 
difficult due to the lack of a neat closed form of E(A\B) = E [E N (trc{pABc))}- It seems 
reasonable to expect it to be monotonous though since since Nc is fixed. 

This idea could presumably be extended to more than three parties and may lead 
to a description that is highly useful by virtue of requiring very few parameters. 

7. Discussion and Conclusion 

The entanglement evolution during random two-qubit interactions was studied in some 
depth and we have provided a detailed proof of the result presented in P that the 
average entanglement approaches the unitarily invariant value within 0(N 3 ) steps. The 
essential idea of the proof is a map from the evolution in state space onto one on a 
much smaller space that tracks the evolution of the purity of the reduced subsystem. 
We then proved that the process can be simulated efficiently on a classical computer 
using stabilizer states. 

We found through numerical studies that there are two phases in the approach: first 
a phase during which entanglement is rapidly spreading through the system, and then 
a phase where the entanglement is suffused throughout the system. Three moments of 
time that could be used to define the partition between these phases were introduced 
and discussed. Firstly the saturation of the average r sat was considered followed by 
the cut off moment T cuto ff. Then we noted that if one restricts the interactions to be 
between nearest neighbours, the entanglement initially scales as the area of the smaller 
region and in the second phase as that of the volume. This led us to introduce r voh 
the moment the entanglement is typically volume scaling. Of these perhaps the cut off 
moment Tcutoff is the most attractive choice, since it gives an unambiguous single time 
that corresponds to the moment that the entanglement probability distribution is equal 
to, for practical purposes, that of the unbiased distribution of pure states. 

The results support the relevance of generic entanglement as we show it 
can be generated efficiently from two-qubit gates. Therefore protocols relying on 
typical/generic entanglement, like [121 [26], [221 EH] gain relevance [TJ. 

The above results may be extended in various directions that will be described 
briefly here. 

Multi-particle entanglement measures - It would be interesting to extend the above 
results to further investigate properties of typical multi-partite entanglement. For 
entanglement measures based on average purities [2] this is possible with the results 
established here. Other measures such as the entanglement of formation may also be 
treated but require extensions of the approach presented here that go beyond the scope 
of this paper. 

Markov Process Quantum Monte Carlo Methods - Our results possess another 
interpretation that is of interest for the numerical study of quantum-many-body systems. 
One numerical approach to classical spin systems is to evolve spin configurations 
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randomly according to the Metropolis rule, i.e. always accepting moves that decrease 
energy and accepting moves that increase energy with a probability proportional to 
exp{— AE/kT}. This reproduces the correct thermal average. One may of course 
consider a similar approach in the quantum setting applying random two-qubit gates 
to progress the state. Our present results fall into this category but apply for infinite 
temperatures as we draw our unitaries from the invariant Haar measure. However, we 
may adapt our analysis to the finite temperature regime employing stabilizer states at 
the expense of having to analyze a more complicated Markov process. Basic ideas of 
our approach carry over and open the possibility for rigorous statements concerning the 
convergence rate of such a Markov Process Quantum Monte Carlo approach. 

Continuous Variables - These question are made additionally complicated in the 
continuous variable setting through the lack of a Haar measure in the setting of non- 
compact groups. However in [53j an approach to proceed is introduced. There one 
notes that it is reasonable to assume that the maximum energy of the global pure state 
is finite. This tames the non-compactness and one can define a way to pick states at 
random. Reference [531 El] gives an explicit method to achieve that for Gaussian states, 
which probably will take on the role analogous to stabilizer states in the general setting. 
This opens up the possibility of studying the questions dealt with here in the continuous 
setting. 

Two phases - The relation between the three moments of time that can be used to 
separate the two phases we observe should be further investigated. We believe analytical 
tools for studying the cut-off effect are sufficiently developed to undertake an analysis 
with the aim of proving relations between r voh T cuto ff, an d T sat respectively. 

Relation to other work - It would be interesting to investigate how our results relate 
to existing work on Spin-gases [55] which are similarly semi-quantal systems. We also 
hope to relate this line of enquiry to work touching on what the typical entanglement of 
the universe is [571 156] . The main difference to the approach here would be to consider 
closed systems. 

Experimental studies - Optical lattices appear to provide a suitable experimental 
setting to test the results obtained here. There is also hope of using Bose-Einstein 
condensates or linear optics to study the continuous variable case. 
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8. Appendix A: Uniform (Haar) measure on the unitary group 

We here briefly introduce the often used uniform measure on pure states, sometimes 
called the unitarily invariant measure. This is a particular instance of a Haar measure, 
which can be viewed as a generalization of the 'flat distribution'. A flat probability 
distribution is the one reflecting no bias with respect to any object. 

Such a distribution on a set of objects is invariant under permutations of the said 
objects. Therefore if there is a group of transformations associated with the set of 
elements, the distribution on those elements should be invariant under application of 
elements of the group. The simplest non-trivial example is probably a coin. Here 
there are two group elements, the identity and the flip. Only the unbiased probability 
distribution P(Head)=l/2 and P (Tail) =1/2 is invariant under those transformations. 

Bearing this in mind, consider picking pure general states at random. The unbiased 
distribution on pure states is here required to be invariant under unitary transforms, 
i.e. P(|\&)) = P(U\^f)), where it is implicit that we are in a continuous setting. This 
requirement uniquely defines the distribution. For a single qubit this can be nicely 
visualised as a uniformly dense distribution on the Bloch sphere, see figure HU 



Z 




Figure 11. Bloch sphere: the unitarily invariant distribution corresponds to a uniform 
distribution over this sphere. 

The normal method to pick pure states from this distribution is to fix an arbitrary 
pure state and apply a unitary picked at random from the associated measure on unitary 
matrices. See for example [52J for the explicit procedure. 

9. Appendix B: Randomizing properties of Haar measure 

The following lemma shows that Requirement [1] is satisfied by our example with Haar 
measure. 
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Lemma 5. Suppose F = F(A, B) is a bi-linear function of one-qubit operators A, B, 
that a a , cr& are two Pauli matrices, and that I is randomly drawn from Haar measure on 
17(2). Then 

[ F(I,I), o = 6 = 0; 

E [F{Ta a T\ Ta b Ti)} = 1 § E we{ ^,, } F(a w , a w ), a = b^0 

! 0, otherwise. 

To prove it, we need an intermediate result. 

Lemma 6. Assume that a a , a ^ 0, is a Pauli operator and T is a random unitary 
drawn from Haar measure on U{2). Then 

Ta a T^ = r x a x + r y a y + r z a z , 

where (r x ,r y ,r z ) is a random vector whose distribution is invariant under permutations. 
Moreover, E [?w] = 5 u y/3 for all u, u' G {x, y, z}. 

Proof. Ta a T^ is a 2 x 2 Hermitian matrix. Therefore, there exist real numbers (in fact 
random variables) r u = Tr(cr u Tcr a T^)/2 (u G {0, x, y, z}) such that 

Ta w T ] = r I + r x a x + r y a y + r z a z . 

Tr (Ta a T^) = Tr(a a ) = implies r = 0. Before we continue, let us state a simple fact 
we will use repeatedly. 

Claim 2 (Conjugation trick). For any unitary Q G U(2), Ta a T^ and QTa a T^Q^ = 
QTa a (QTY have the same probability distribution. Therefore, the distribution of 
(r x ,r y ,r z ) is the same as that of (r' x ,r' ,r' z ) , where 

r' u = l - Tr (a u QTa w TiQi) = X - Tr((Qa u Q^)Ta w T^). (24) 

In fact, this follows directly from the invariance property of Haar measure. Since 
T and QT have the same probability distribution, so do Ta a T^ and QTo~ a (QT)^ (which 
are the same deterministic function of Q and QT, respectively) and the same holds for 
(r x ,r y ,r z ) is the same as that of (r' x ,r' y ,r' z ). 

We now apply the trick as follows. 

(i) Take Q = H (the Hadamard matrix). Then Qo x Q^ = o~ z , Qcr z Q^ = a x and 
Qo~ y Q} = o y . This implies that 2r' x = Tr((Qa x Q^)Ta a T^) = Tr(a y Ta w T^) = 2r y , 
2r' y = Tr((Qa y Q^)Ta a T^) = Tr(a z Ta a T r ) = 2r x and r' z = r z . Hence {r' x ,r' y ,r' z ) = 
(r z ,r y ,r x ) and (r x ,r y ,r z ) have the same distribution. 

(ii) Now take Q = |0)(0| — i\ Qcr x Q^ = a y , Qa y Q} = a x and Qa z Q^ = a z . 
It follows from the above reasoning that (r y ,r x ,r z ) and (r x ,r y ,r z ) have the same 
distribution. 
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(iii) Take Q = a z this time. Then Qa x Q^ = —o~ x , Qa y Q^ = —cr y , Qa z Q^ = a z , so 
(—r x ,—r y ,r z ) and {r x ,r y ,r z ) have the same distribution. Similarly Similarly, we 
can take Q = a y or Q = a x to show that (—r x , r y , —r z ) and (r x , —r y , —r z ) also have 
the same distribution. 

The first two items show that the distribution of (r x , r y , r z ) is invariant by transposition 
of the x and z coordinates and of the x and y coordinates. It follows that the distribution 
is also invariant under transposition of the y and z coordinates (which is a composition 
of a xz transposition followed by a xy transposition and another xz transposition). 
Since any permutation is a composition of transpositions, we have shown that the the 
distribution of {r x ,r y ,r z ) is invariant under permutations of the coordinates. Moreover, 
it also follows that 

1 = = ^ [Tr [(T^Tt) 2 ]] = E [r 2 + r 2 + r 2 ] = 3E [r 2 ] for all u G {*, y, z}. 

Thus it only remains to show that E [r u ?v] = if u ^ u' . To this end, we use item 3. 
If for instance u = x, v! = z we recall that (—r x , —r y , r z ) and (r x , r y , r z ) have the same 
distribution, hence r x r z and — r x r z also have the same distribution, which implies our 
claim. The other cases follow similarly. □ 

We can now prove Lemma [5j 

Proof. First assume that a ^ b. Without loss of generality, assume that b ^ 0. We 
apply an idea based on the Conjugation Trick from the previous proof. There exists 
c G {x, y, z} such that o a and cr c commute and o~b anti-commutes with a c . As U and Ua c 
have the same distribution, (Ua c )a a (Ua c )^ = Ua a U^ and {Uo- c )a b {Ua c )^ = —UabU^, we 
have 

E [F(Ua a U\ Ua b U^)] = E [F{(Uo c )a a {Uo c )\ (Ua c )a b (Ua c Y)] 

= -¥.[F(Ua a U\Ua b U ] )] , 

thus the expected value is 0. Now if a = b = 0, the result is trivial. If a = b ^ 0, we can 
write Uo~ a U^ = r x o~ x + r y o~ y + r z o~ z as in Lemma El and by bi-linearity 

[r w >r w r>]F(a w r,a w >r). (25) 

w',w" 

Applying Lemma El finishes the proof. □ 

10. Appendix C: Stabilizer states 

Stabilizer states are a discrete subset of general quantum states, which can be 
described by a number of parameters scaling polynomially with the number of qubits in 
the state [281 [291 [8]. 
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A stabilizer operator on N qubits is a tensor product of operators taken from the 
set of Pauli operators 

and the identity /. An example for N = 3 would be the operator g — oi Cg) I Cg) sigma^. 
A set G = {gi,...,gx} of K mutually commuting stabiliser operators that are 
independent, i.e. Yli=x dT = I exactly if all Sj are even, is called a generator set. For 
K = N a generator set G uniquely determines a single state that satisfies gk\$) = \4>) 
for all /c = 1, . . . , N. Such a generating set generates the stabilizer group. Each unique 
such group in turn defines a unique stabilizer state. 

For example the GHZ state 1 000) + | 111) is defined by the generator set {a\ ® o\ <8> 
(Ji, <T ® cr 3 ® cr 3 , cr 3 (g) cr 3 ® cr ). 

A key observation that is useful for the considerations here is the fact that the 
bipartite entanglement of a stabilizer state, i.e. the entanglement across any bipartite 
split, takes only integer values [311 [32] . 

Finally we note the fact that in order for the stabilizer state to be non-trivial it is 
necessary and sufficient that the elements of the stabilizer group (a) commute, and (b) 
are not equal to -I [8] . 



